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IMPROVED VERSION OP THE SPLIT ROUTINE FOR CLASSY 


Introduction 

The SPLIT routine originally Implemented for CLASSY was 
Intended to be a quickly written, temporary routine to get the 
system running. In fact, another version had been designed and 
coded earlier, but was Judgrd -oo difficult to debug quickly. 

The original version of SPLIT attempted to minimize a 
quadratic form of the difference between the observed skewness 
and kurtosls and the skewness and kurtosls produced by the 
model. The model was then adjusted by a kind of steepest des- 
cents approach until further adjustment would not prove practi- 
cal. This frequently requires 50 or more Iterations, which can 
consume a fairly large amount of machine time. In fact, with 
other speedups of CLASSY being Implemented by Eloglc , SPLIT 
could well become the primary consumer of machine time. 

In addition, analyses done since the original SPLIT was 
encoded Indicate that It may not give the best solution for 
common situations. Thus a new SPLIT routine would have to pro- 
duce a better set of solutions than the original version. 

For both of the above reasons, Eloglc designed a new 
SPLIT routine which Is being Implemented by Lockheed. 
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Calculation Procedure 

We wish to find approximate values for the parameters 
(mean, covariance, and proportion) of two distributions, given 
the same parameters plus the traces of skewness and kurtosis 
for the mixture distribution. The calculation is done in a 
coordinate frame where the mixture covarlanct» is the unit 
matrix, which allows us to easily calculate inner products 
with respect to that covariance, etc. 

The two clusters to be found are defined by 

cluster 

proportion a 0 

mean V v 

covariance o* t* 

We are given S (the skewness vector 
Input kurtosis matrix K^, which is immediately used to derive 
the practical kurtosis matrix, K*K^-(d+2)£, where d Is the 
dimensionality of the space. 

The Initial variables are obtained as In the present 
SPLIT routine, and are used to set up the new clusters as In 
that routine. The reader Is referred to the documentation for 
SPLIT Version I for the details of these calculations. In 
addition the rotation to the frame with unit covariance and the 
Initialization of the iterations is done as in Version I. 
Version II differs from Version I only in the form of the 
iterative step and convergence calculation, and in the handling 
of certain special conditions. 
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We wish to solve the equations 

66^ + 0 * + ^5^ X* ■ 1 (covariance— -a symmetric 

^ matrix, 1 means the 

unit matrix) 

(la) 

(2D*6 + 6 trD*-yfi*6) ■ S (skewness — a vector) 

(lb) 

(2D" + D* trD* + ( 3 .] fg . “ - ^ .) 6*5 . (Ic) 


2y + fi (D*6) J - 

where 

-l£Y<l is the mixing parameter 

6 is the displacement vector between the clusters (”y-v) 

(2a) 

D* * a*-T* (2b) 

The unknowns are o*, t*, 6, and y* 

This system has one more unknown than equations, requir- 
ing an additional assumption. The simplest such assumption is 
Y»0; other possibilities are trD*»0, etc. We will use y“0 
(equal weight clusters), here, but some huerlstic allowance 
must be made at the end of the calculation to avoid wild solu- 
tions. Some checking of solutions must be done In any event to 
avoid problems caused by multiple clusterr, etc. 


y 6*D* - Y trD*6 6)^ » K 

(kurtosls, (d+2)Z" 
removed; a symmetric 
matrix) 
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The equations to be solved become 

^ (2D*6 + 6 trD*) - S (3a) 

^ (2D** ♦ D*trD* - ««^) - K (where «* - (3b) 

which are to be solved for 6 and D^. (Note that and appear 
here only via D*.) The definition of D*, along with the equation 

^ 66^ + j 0 * + I T* - 1, (3c) 

can be treated separately and later than (3a) and (3b) except for 
the positivity constraint on o* and t* . This requirement Is 
handled as a special constraint during the solution of the system 
(2b) and (3c); If no solution Is possible « the error solution Is 
taken. 

Under normal circumstances, the matrix K will have exactly 
one negative eigenvalue. If K has no negative eigenvalues, 
this Indicates that the two hypothesized clusters differ primar- 
ily In covariance (a condition which might reasonably be flagged). 
A situation where K has multiple negative eigenvalues can be 
contrived with only two clusters, but this generally Indicates 
multiple clusters, requiring a more even division of the space 
for later additional SPLITtlng. 

Additional Conditions 

Besides the actual numeric solution, the following special 
conditions must be observed: 



1) If there are excess negative eigenvalues below a 
certain threshold, probably Indicating multiple subclusters, 
the "basic” (initial) solu^:lon la used. A message condition 
Is also raised. 

2) If the solution discovered has covariance matrices 
which would divide the space too unevenly, then the "basic” 
solution is also used, with a message condition. 

3) The "basic" solution and a third message condition 
are used If the numerical iteration falls to converj?r 3 . 

The "basic" solution Is a hueristically calculated solu- 
tion to the equations which is used as the Initial solution and 
as a last resort solution in the case of errors. It Is In fact 
the Initial solution for the original SPLIT routine. 
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Iteration Pi^ocedure for Verelon II of SPLIT 

Write AS - I (2D*6 + « trD*) - S <<ia) 

AK » I (20“ + D* trD* - I 6*36^/ - K (Mb) 

for the solution errors. (These will be 0 If 6 and are 
correct.) The Initial values are determined by the "basic” 
procedure, described above. !nie Iterations are done in a 
frame where D* is constantly forced to be a diagonal matrix— 
that Is, if D* Is corrected by an off-dlafonal term, the coordi- 
nate frame Is rotated to make be diagonal again. (This 
requires rotating D^, 3, K, and S.) 

Thus the frame In which the problem is stated rotates from 
Iteration to iteration, overall from one in which K is diagonal 
to one in which the final solution is diagonal. These 
rotations must be stored In a separate rotation matrix which 
gives the net rotation between the transformed frame iuid the 
original one. When making the rotations, 3 and are both 
rotated Incrementally to the new frame, and S and K are rotated 
from their original values to the new frame by the accumulated 
transformation before AS and AK are calculated. 

The variables are separated Into two groups: 

1) 3 and the diagonal elements of D* 

2) the off-diagonal elements of 


7 


The variables of Group 1 are modified during iteration 
using a matrix version of Newton's method; those in Group 2 
are modified by direct substitution (Aitken iteration)* 
inq^lemented by frame rotation. 

Iteration on 6 and the diagonal elements of ; 

We take AS and the diagonals of AK and use them to cor- 
rect 3 and the diagonals of via 



(2dx2d mtx) 


Since this is a Newton's method technique, it is subject 
to bad overshoots, etc. Therefore, a test must be made that the 
various error terms AS and diag AK decrease due to the iteration, 
if they do not, the increment given to 3 and diag can be 
decreased. (For small enough increment, they are guaranteed to 
decrease.) This test may be applied to the whole iteration, not 
Just the Newton's method part; this would save the need to make 
duplicate calculations of AS and AK. In the worst case of non- 
convergence, the "basic" solution could be taken, with the rais- 
ing of an error flag. 
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The TarlouB derivatives are: 


»«i .1 



«l^trD* 


(5a) Is a Kroneoker delta function, 

•1 if i*k, else 0; not to be con- 
fused with 6^ the vector, which 

has 1 or no subscripts, instead 
of 2) 


a 


dAS 


L, - 

ki 


I * 


l®ki 


* T *k*u 


+ I « 


l*k'. 


(5b) 


dAK 


77 


ii 


dAX 


U 


aD> 


ki 


f *l‘k«4 - «lk«4«* - F «4k*l«‘ <«'■{ «l‘l> 


♦ 


? lj*kt 


(5<J) 


Although these equations include the off-diagonal terms 
of AK and D^, only the diagonal terms are used In the Newton's 
method matrix. 

Off diagonal elements 

The off diagonal elements of D’ Are adjusted ii\sing direct 
substitution. The off diagonal ecmqxments of AK are used to 
determine the change in D* required. The amount of change 
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required can be determined from 



Note that D^ la diagonal (In the current coordinate frame) bo 
that either l»k and J*A or !■! and J*k for the derivative to be 
non-zero. By symmetry conditions on dK and we may assume 
l>j and k>A, l*k and so that we calculate 



llie two-part calculation of 6 and Is basically a Newton's 
method calculation with the derivative terms between the parte. 


treated as 0 
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Pinal Solution 

After D* and 6 have been calculated, the program must 
check that the variances o^ and are satl8f8;.&ury. This Is 
done via the relations 

D* ■ c* - T* 

1 » 6^ + j o* + j X* 

and may be calculated from these, and checked for posi- 
tive definiteness. Actually, o^ and must be held fairly far 
away from zero eigenvalues and must not have too large or small 
a volume ratio. The actual threshold for these tests is under 
the control of parameters. If these tests fail, we revert to 
the ”baslc" solution (this is error 2 above) . 

After checking a* and , the system must be rotated back 
to the original frame of reference by the inverse of the accu- 
mulated transforms of D*. These back-trans formed 6, o*, and t* 
can then be used to generate the pair of new clusters. 


SPLIT VERSION 2 PLOW CHART 
(Includes subroutine names) 
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Summary 

This report has developed the theory for a new. Improved 
version of the SPLIT routine. This version Is Intended to 
replace the old version which was put together quickly, without 
using any fundamental understanding of the problem. 
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